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This article provides a brief discussion of the historical context for the major recent 
advances in enlisted selection and classification that are the subject of this special issue. 
By providing a perspective on the difficulty personality assessment has had gaining 
traction as a screening measure, it supplies a basis for understanding the significance of 
recent advances in personality measurement. This article also summarizes the military 
research on interest measurement that will receive more extended treatment later in this 
issue. It relates the work in this issue on new information technology testing to the role 
of information testing in the current military operational selection and classification test 
battery, the Armed Services Vocational Aptitude Battery (ASVAB), and discusses the 
background for 2 additional cognitive measures, Coding Speed and Assembling Ob- 
jects. Finally, it previews issues to be discussed in the commentary article at the end of 
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Since 1917, with the development of the 
Army Alpha and Beta, the military services 
have been at the forefront of selection and 
classification research and practice. The his- 
tory of military enlistment testing has been 
marked by several important developments 
since the implementation of these early tests, 
including the gradual evolution from mea- 
sures of general cognitive ability to more 
differentiated multiple aptitude measures de- 
signed to serve classification as well as selec- 
tion, the origination of a single joint-service 
battery, the Armed Services Vocational Apti- 
tude Battery (ASVAB) to replace those de- 
veloped separately by the Army, Navy, and 
Air Force, the transition from an exclusively 
paper and pencil format to the inclusion of a 
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computerized adaptive format, the validation 
of the joint-service battery against perfor- 
mance in each of the services, and the addi- 
tion of a spatial test to this battery (Defense 
Manpower Data Center, 2012; Defense Man- 
power Data Center, n.d.; Rumsey, 2012). 

As the 20th century gave way to the 2Ist, a 
new revolution in military testing could be 
observed. Noncognitive tests, which had pre- 
viously played at best a secondary role in 
such testing, were now a matter of keen in- 
terest. New content was envisioned, espe- 
cially with respect to the increased focus on 
information technology in military jobs. The 
need to reconsider existing content and test- 
ing methodology also came into focus. These 
themes were given prominence by an ASVAB 
Review Panel, established to provide recom- 
mendations concerning military testing (Dras- 
gow, Embretson, Kyllonen, & Schmitt, 
2006). 

Complete treatment of all these changes is 
beyond the scope of this special issue. This 
volume highlights new noncognitive tests and 
their implementation in multiple screening 
contexts, a new information and communica- 
tions technology literacy test, a perceptual 
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speed and accuracy test, and a spatial ability 
test. 


Noncognitive Tests 


Two types of noncognitive measures have 
received particular attention in the history of 
military enlistment testing. One type might be 
referred to as personality, although measure- 
ment of personality constructs may be indirect 
and even unintentional. For example, back- 
ground and interest measures may reveal as- 
pects of an individual’s personality even if the 
relevant personality constructs are not specified. 
The terms personality and temperament are of- 
ten used interchangeably in the military litera- 
ture. For a time, the term “personality” was 
avoided in the context of Army selection and 
classification testing research to preclude con- 
fusion with more clinically oriented measures 
often identified as personality tests. 

The second type is vocational interests, defined 
by Van Iddekinge, Putka, and Campbell (2011, p. 
14) as “relatively stable individual differences that 
influence behavior through preferences for certain 
work activities and work environments.” The dis- 
tinction between vocational interest testing and 
personality testing is often difficult to draw, as 
interests may be either a reflection of an individ- 
ual’s personality or vocational preference. Some 
tests may have elements of both types. However, 
in recent decades the distinction has been more 
consciously drawn, so we will treat the types sep- 
arately even while acknowledging the overlap. 


Personality Testing 


Although interest in noncognitive correlates of 
officer performance can be traced to the use of 
situation tests in Germany as early as 1927 (Rum- 
sey & White, 2010), little attention was given to 
personality testing for enlistment selection before 
the 1950s. Then, measures that “captured aspects 
of personality such as self-confidence, self- 
assertiveness, emotional stability, leadership, and 
devotion to group objectives” (Greenston, 2012) 
were added to the Army battery used for classify- 
ing soldiers into jobs, the Army Classification 
Battery. Each of the services developed back- 
ground measures for enlisted assessment in the 
1970s—the Air Force’s History Opinion Inven- 
tory (HOD, the Navy’s Recruiting Background 
Questionnaire (RBQ), and the Army’s Military 


Applicant Profile (MAP) (Rumsey & White, 
2010). However, when the services combined 
their efforts to implement the joint-service 
ASVAB in 1976, it contained only cognitive 
measures (Rumsey, 2012). 

The value of personality measures for selection 
has been questioned during much of the history of 
standardized testing. A review of available re- 
search (Guion & Gottier, 1965) that summarily 
dismissed the value of personality measures in 
personnel selection reflected the general attitude 
toward such measures at the time. A more prom- 
ising result was obtained with the large scale 
Army selection and classification project initiated 
in the 1980s known as Project A. Using a thought- 
ful and thorough development process involving 
the identification of potentially relevant personal- 
ity dimensions through a comprehensive literature 
review, an expert judgment task focused on ex- 
pected validity of each dimension (Peterson & 
Wing, 2001), and large-scale pretesting, research- 
ers constructed an instrument known as the As- 
sessment of Background and Life Experiences 
(ABLE; Hough, Barge, & Kamp, 2001). The va- 
lidity of the ABLE was determined based on two 
types of performance criteria—(a) task profi- 
ciency (“can do” performance) and (b) a combi- 
nation of ratings and administrative measures that 
were determined to represent the extent to which 
the individual performed over time (“will do” 
performance). The ABLE added little to the pre- 
diction of proficiency, but contributed much to the 
prediction of the will do criteria (Oppler, McCloy, 
Peterson, Russell, & Campbell, 2001). 

ABLE was also found to be an effective 
predictor of premature separation from the 
Amy (i.e., attrition) (White, Young, & Rum- 
sey, 2001). A joint-service measure drawn from 
previous research by each of the services, the 
Armed Services Applicant Profile (ASAP), was 
also found to be an effective predictor of mili- 
tary attrition (Trent, 1993). The Department of 
Defense (DoD) saw the potential for use of a 
measure that combined the ABLE and ASAP 
for enlisted screening, the Adaptability Screen- 
ing Profile (ASP), but implementation was 
thwarted by a concern that when used opera- 
tionally, applicants’ motivation to present them- 
selves positively rather than accurately, often 
pejoratively and unsympathetically referred to 
as “faking,” would undermine the utility of the 
measure (Trent, 1993). 
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Strategies to control faking then became a ma- 
jor focus of military research on enlistment per- 
sonality tests. Fan et al. (2012) identified three 
paradigms for faking mitigation—reactive, pre- 
ventative, and a combination of the first two. The 
Army’s effort to develop a faking resistant attri- 
tion screen focused on prevention, through a mul- 
tidimensional forced choice strategy. The ratio- 
nale for this approach, the manner in which it was 
translated into a measurement instrument, how the 
Army perceived its screening potential, how it was 
combined with other measures to form what be- 
came known as the Tier Two Attrition Screen 
(TTAS), and further developments in attrition 
screening, constitute the content of the next article 
in this special issue, by White, Rumsey, Mullins, 
Nye, and LaPort (2014, pp. 138-152). 

The Army’s success in developing and imple- 
menting a personality screen for the prediction of 
attrition stimulated interest in wider use of person- 
ality measures in the enlistment screening process. 
The TTAS was implemented only for screening 
nonhigh school diploma graduates, who by policy 
constitute only a small portion of military enlist- 
ments (Young & White, 2006). Why not, Army 
leaders asked, use personality measures in screen- 
ing at least a portion of the high school diploma 
graduates? This interest was accompanied by a 
new development in personality measurement. A 
group of researchers from the Drasgow Consult- 
ing Group (DCG) were exploring the viability of 
computer adaptive personality testing (e.g., Stark, 
Chernyshenko, & Drasgow, 2005). Computer 
adaptive testing could offer efficiencies that paper 
and pencil testing could not match, but the ap- 
proach as applied to personality measurement pre- 
sented such conceptual and psychometric chal- 
lenges that its adoption had hardly been 
contemplated before the DCG researchers began 
their work. After the article on attrition screening, 
Stark et al., (2014, pp. 153-164) will describe how 
the DCG work for the Army led to the develop- 
ment and experimental implementation of the Tier 
One Performance Screen, and how a separate 
DCG effort resulted in an experimental Navy 
computer adaptive screening instrument. 


Vocational Interest Testing 


Vocational interest testing focuses on the fit 
between individual interests and the environ- 
ment and tasks associated with a particular job, 
as well as the congruence of interests between 


one potential or actual job incumbent and the 
other incumbents. The premise behind such 
testing is that the better the fit, the greater the 
likelihood that the individual will be satisfied 
with the job, leading to greater retention and 
possibly even better performance. 

The first significant work on an interest mea- 
sure designed for enlisted military applicants 
was the Navy’s development of the Navy Vo- 
cational Interest Inventory (NVII), a process 
that began in 1946 and was completed in 1961. 
Existing commercial instruments, designed for 
those seeking careers requiring a college edu- 
cation, were deemed inappropriate for those 
who would be seeking the types of blue collar 
jobs available to enlisted service members. 
Thus, the NVII was built from the ground up, 
based on activities that reflected divergent in- 
terests between members in different Navy oc- 
cupations (Clark, 1955, 1961). Validation re- 
search indicated that, when the jobs of Navy 
enlistees were aligned with their interests, these 
enlistees were indeed more likely to reenlist and 
perform better (Lau, 1969). 

The Air Force followed with the develop- 
ment of the Vocational Interest Career Exami- 
nation (VOICE) in the early 1980s. Like the 
Navy, the Air Force eschewed existing com- 
mercial instruments, developing items linked to 
Air Force jobs. The VOICE consisted of four 
types of items: occupational titles, work tasks, 
leisure time activities, and desired learning ex- 
periences. Respondents indicated preferences in 
terms of like, indifferent, or dislike. The mea- 
sure was found to be a good predictor of job 
satisfaction (Alley & Matthews, 1982). 

A few years later, when the Army desired to 
test a vocational interest inventory in Project A, 
it used the VOICE as a starting point. The 
Army’s version, named the Army Vocational 
Interest Career Examination (AVOICE), was 
refined and revised to meet Army needs. The 
AVOICE has been found to predict perfor- 
mance and provide results that are consistent 
with one’s current military occupational spe- 
cialty (MOS) (Hanson, Paullin, Bruskiewicz, & 
White, 2003). 

The DoD also initiated a program of interest 
assessment as part of its Career Exploration Pro- 
gram (CEP). This program, initiated in 1992, in- 
volves administration of both the ASVAB and an 
interest measure to high school and college stu- 
dents to both provide information to help them 
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explore career options and to identify qualified 
leads for service recruiters. Initially, the interest 
inventory used in the CEP was one already avail- 
able, the Self Directed Search (SDS; Holland, 
1985). This instrument was based on Holland’s 
vocational personality theory, which determined 
that person-job match could be based on six in- 
terest categories collectively named RIASEC: Re- 
alistic (R), Investigative (I), Artistic (A), Enter- 
prise (E), and Conventional (C) (Wall, Wise, & 
Baker, 1996). 

The SDS was then used to validate the DoD 
instrument, the Interest-Finder, in the CEP. A 
revised interest inventory, Find Your Interests, 
is used in the current ASVAB CEP. DoD de- 
cided to develop its own instrument to better 
focus on the youth population it was testing. 
Although it continued to use the RIASEC 
model, it defined its own content areas and 
types of items for the Interest-Finder, which 
was introduced in 1995 (Wall, Wise, & Baker, 
1996). In recent years, the Army has been ex- 
perimenting with its own RIASEC-based mea- 
sure of interest in a variety of work activities, 
the Work Preferences Survey (WPS), which has 
been found to relate well to a variety of perfor- 
mance and attitudinal criteria (Putka & Van 
Iddekinge, 2007; Van Iddekinge et al., 2011). 

For several years, the Navy has also been 
developing a new computerized interest inven- 
tory, known as Jobs and Occupational Interest 
in the Navy (JOIN). Components of this inven- 
tory include work areas, work styles, work en- 
vironments, and work activities. A key charac- 
teristic of this inventory involves the use of 
pictures to represent elements of each compo- 
nent (Farmer, Watson, Alderton, Michael, & 
Hindelang, 2006). Ingerick and Rumsey (2014, 
pp. 165-181) describe these developments in 
interest measurement for those in an enlistment- 
eligible population in a separate article in this 
special issue. 


Cognitive Testing 
Information Technology 


In the earliest days of military testing, the 
focus was on general aptitude, not on specific 
capabilities that might translate to differential 
performance across jobs. However, as the cred- 
ibility of military testing grew, so did the inter- 
est in building tests that could be effective in job 


placement. As the services began to build clas- 
sification batteries, they focused particularly on 
information tests that might indicate an individ- 
ual’s interest and aptitude with respect to par- 
ticular subject areas. The current ASVAB has 
several information tests designed principally 
for use in person-job match. As the ability to 
receive, transmit, and interpret computerized 
information began to play an increasingly 
greater role in the success of military opera- 
tions, the lack of a measure specifically focused 
on such ability has become a major concern 
among the services. The Air Force was given 
the lead on developing such a measure, with the 
other services also participating in its valida- 
tion. Therefore, another article in this special 
issue, by Trippe, Moriarty, Russell, Carretta, 
and Beatty (2014, pp. 182-198) will deal with 
the research that has been conducted on this 
instrument. 


Perceptual Speed and Accuracy 


Before 2002, the ASVAB contained two 
speeded tests, Numerical Operations and Cod- 
ing Speed. These tests provided coverage of one 
of the factors identified in the ASVAB, Clerical 
Speed (Alderton, Wolfe, & Larson, 1997), and 
aided in classification. However, it was difficult 
to maintain scoring consistency for these tests 
across diverse testing conditions, and they were 
discontinued as Joint-Service tests. The Navy 
alone continued to administer Coding Speed as 
a special test and has amassed evidence support- 
ing its reintroduction as part of the Joint-Service 
testing battery. This research is discussed in this 
issue by Held, Carretta, and Rumsey (2014, pp. 
199-220). 


Spatial Ability 


The ASVAB originally had a measure of 
spatial ability, Space Perception (Maier, 1993), 
although this test was later dropped. In Project 
A, spatial tests were found to provide incremen- 
tal validity over the ASVAB in predicting task 
performance in Army jobs. Further supportive 
evidence of the value of spatial tests in predict- 
ing job performance in the military led to the 
decision to include a measure of spatial ability, 
Assembling Objects, in the ASVAB. The article 
by Held and her co-authors referred to above 
discusses research recently conducted support- 


e of its allied publishers. 


ghted by the American Psychological Association or on 


This document is copyri 
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly. 


INTRODUCTION 135 


ing the use of Assembling Objects for enlisted 
classification. 


Issues and Implications 


The research described in this special issue 
carries with it a number of concerns and impli- 
cations, which will be discussed in the Com- 
mentary, the final article. Perhaps the most sig- 
nificant issue is whether this research is on the 
right track, with respect to such questions as: 


1. Is it addressing those attributes that, be- 
yond those already being measured, are 
most important for success in the military 
environment? Does the research indicate 
that an approach combining cognitive and 
noncognitive measures is likely to provide 
greater benefits to the military than one 
focused only on cognitive aptitudes? Is the 
research focusing on the right noncogni- 
tive attributes and is it measuring them in 
the most effective way? With respect to 
cognitive testing, is there any benefit to be 
gained by adding another information or 
specific aptitude test? Will this just 
amount to another measure of general ap- 
titude that is so highly correlated with the 
current operational ASVAB tests that it 
adds nothing, or will it identify those who, 
through their understanding and inclina- 
tion, are uniquely qualified to perform vi- 
tal tasks in certain jobs? 

2. Is it using the most appropriate criteria to 
determine effectiveness? Historically, two 
types of outcomes have been of greatest 
interest in military selection and classifi- 
cation research: retention (defined here as 
continuation both within and beyond 
one’s initial obligated tour of service) and 
performance. Project A demonstrated that 
there are two principal classes of perfor- 
mance that must both be considered if one 
is to have a complete picture of an indi- 
vidual’s benefit to the organization. One 
has to do with job proficiency, the other 
with the motivation to perform effectively 
over time (Rumsey, Peterson, Oppler, & 
Campbell, 1996). In the commentary, the 
information provided by the criteria used 
in each investigation will be examined. 


The commentary will also consider the con- 
tributions made by this research to military se- 
lection and classification and to psychological 
research in general. It will examine the extent to 
which new measures described in this special 
issue have been implemented and the lessons 
that can be drawn from this recent history. It 
will also examine how these research activities 
have contributed to the science of attribute mea- 
surement for selection and classification. Mili- 
tary testing has always been both a consumer of, 
and contributor to, the broader field of person- 
nel testing. 

Finally, the commentary will consider issues 
for future research. It will consider neglected 
areas in the realm of individual differences that 
might be primary candidates for future research. 
It will suggest that pessimism regarding the 
potential classification gains possible with ex- 
isting measures is misplaced, but will note that 
the lack of available cost-effective performance 
measures is a particular handicap for research 
needed to (a) assess continuously the value of 
current measures as job requirements and the 
operational military environment changes, and 
(b) examine the potential value of new measures 
for classification purposes. Thus, research to 
both improve performance measurement and to 
enhance tools for estimating validity for jobs 
lacking performance measures should be impor- 
tant future priorities. 
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